Variational Principle underlying Scale Invariant Social Systems 
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MaxEnt's variational principle, in conjunction with Shannon's logarithmic information measure, 
yields only exponential functional forms in straightforward fashion. In this communication we show 
how to overcome this limitation via the incorporation, into the variational process, of suitable dy- 
namical information. As a consequence, we are able to formulate a somewhat generalized Shannonian 
Maximum Entropy approach which provides a unifying "thermodynamic-like" explanation for the 
scale-invariant phenomena observed in social contexts, as city-population distributions. We confirm 
the MaxEnt predictions by means of numerical experiments with random walkers, and compare 
them with some empirical data. 



INTRODUCTION 

Scale-invariant phenomena are plentiful in Nature and 
in artificial systems. Illustrations can be encountered 
that range from physical and biological to technologi- 
cal and social sciences [T]. Among the latter, one may 
speak of empirical data from scientific collaboration net- 
works j2,, cites of physics journals [3], the Internet traf- 
fic [4] , Linux packages links [5] , popularity of chess open- 
ings [6] , as well as electoral results [H |8] , urban agglom- 
erations [HI HO] and firm sizes all over the world [TT]. 
These systems lack a typical size, length or frequency for 
observables under scrutiny, a fact that usually leads to 
power law distributions 

Px{x)-^l/x'+\ (1) 

with A > 0. The class of universality defined by A = 1, 
corresponding to the so-called Zipf's law (ZL) in the 
cumulative distribution or the rank-size distribution re- 
ceived much attention |5H5l[5HT5]. Maillart et al. [S] have 
found that links' distributions follow ZL as a consequence 
of stochastic proportional growth. Such kind of growth 
assumes that an element of the system becomes enlarged 
proportionally to its size fc, being governed by a Wiener 
process. The A — 1— class emerges from a condition of 
stationarity (dynamic equilibrium) |15| . ZL also applies 
for processes involving either self-similarity |6j or fractal 
hierarchy |13j . all of them mere examples amongst very 
general stochastic ones [13]. The instance A = is rep- 
resentative of a second universality-class following Costa 
Filho et al. [7], who studied vote-distributions in Brazil's 
electoral results. Therefrom emerge multiplicative pro- 
cesses in complex networks [8]. A = 0— behavior ensues 
as well in i) city-population rank distributions JJjj, ii) 
Spanish electoral results [TB], and ui) the degree distri- 
bution of social networks [TT]. As shown in Ref. [T5] . 
this universality class encompasses Benford's Law [T^ . 
In the present vein, still another kind of idiosyncratic 
distribution is often reported: the log-normal one [20] . 
that has been observed in biology (length and sizes of 



living tissue [21]), finance (in particular the Black Sc- 
holes model [H]), and firms-sizes. The latter instance 
obeys Gibrat's rule of proportionate growth [25, that 
also applies to cities' sizes. Remarkably enough, all these 
variegated and sometimes quite complex systems share a 
scale-free growth behavior. 

Together with geometric Brownian motion, there is a 
variety of models arising in different fields that yield 
Zipf's law and other power laws on a case-by-case ba- 
sis P [TUl [T51 [^5H27] . as preferential attachment [J and 
competitive cluster growth [H] in complex networks, used 
to explain many of the scale-free properties of social, 
technological and biological networks. For instance we 
may mention urban dynamics ^28J , opinion dynamics |29j , 
and electoral results [7] [3D] , that develop detailed realis- 
tic approaches. Ref. [31] is highly recommendable as a 
primer on urban modelling. Of course, the renormaliza- 
tion group is intimately related to scale invariance and 
associated techniques have been fruitfully exploited in 
these matters (as a small sample see [?7l 13^ ). 
Can the celebrated maximum entropy principle of Jaynes' 
(MaxEnt) [33l [34] explain the disparate phenomena re- 
counted above? At first sight, one faces a seemingly 
unsurmountable difficulty. MaxEnt's variational prin- 
ciple, in conjunction with Shannon's logarithmic infor- 
mation measure, can not in straightforward fashion yield 
power laws (nor more involved combinations that include 
power laws) , but only exponential distributions [34j . This 
fact constituted one major motivation [3S] for introduc- 
ing Tsallis' information measure [33] (and its associated 
Tsallis-MaxEnt treatment ^7\). One immediately gets 
power laws thereby as a result of varying the measure. 
We have shown in Ref. [38] how to overcome the above 
Shannonian limitation via suitable incorporation, into 
the MaxEnt principle, of dynamical information. As a 
consequence, we are now able to formulate a Shannonian 
Maximum Entropy approach |33LI34| common to aH these 
systems. As stated in [38], this technique exhibits the pe- 
culiarity of including "equations of motion" as a part of 
the required a priori knowledge which always needs to be 
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incorporated into the accompanying Jaynes' variational 
problem. The desired goal can in this way be successfully 
achieved, which provides a unifying "thermodynamic- 
like" explanation for the above mentioned disparate phe- 
nomena. The approach explicitly reconciles two apparent 
different viewpoints, those attached to i) growth models 
and ii) information-treatments |39) . 



THEORETICAL PROCEDURE 
General approach 

Accordingly, consider a rather general stochastic dy- 
namical equation for the observable x, namely. 



x(t) — k g[x{t)]; g arbitrary. 



(2) 



where k represents the derivative of a Wiener process 
(characterized by stationary independent increments) 
that frequently occurs in economic and social systems. 
We assume that there exists an appropriate transforma- 
tion of X that makes ^ invariant and a hallmark of the 
symmetry characterizing the system. Thermodynamics 
and many areas of physics have been shown by Frieden 
and Soffer [40j to be typified by translational symmetry 
if they are theoretically described a la Fisher [JD]. This 
entails g{x) — 1, which makes Eq. ^ the emblematic 
equation for linear Brownian motion. Our considerations 
will revolve around a new variable u — u{x) such that 
dx/du — g{x), a variable that 

1. should linearize the dynamic equation, 

2. make the original x— symmetry a translational one 
u{t) = k, and 

3. according to the "central hypothesis" of Ref. [55] . 
constitutes the tool for introducing dynamical 
information into MaxEnt by working with the 
M— entropy of the system. 

One has EE 



S[pu{u)]^ I dupu{u)\og[pu{u% (3) 
Jn 

with $1 an appropriate "volume" in u— space. The equi- 
librium probability density pu{u) is derived from Jaynes' 
MaxEnt principle [33] 



S[pu{u)]~Y,\{f,(u)) Uo, 



(4) 



where the mean values of the functions fi{u) describe 
the a-priori known constraints on the system, while 
are the associated Lagrange multipliers. Our constraints 
represent conservation rules, operating on our system. 



that strongly condition the configuration of the equilib- 
rium state. The general solution of the Jaynes problem 
is ^ 



pij(u)du — exp 



du 



(5) 



which, changing back to x lead us to a px{x) of the form 

dx 



px{x)dx = exp 



E 



Aj/j(m(x)) 



9{x) 



(6) 



The main difference with the usual MaxEnt (ME) solu- 
tion is the Jacobian du/dx — l/g(x). This looks trivial 
enough, but the Jacobian contains dynamical informa- 
tion, otherwise absent from the treatment. In a manner 
of speaking, we are thereby "extending" the exponential- 
like form of the Jaynes' ME-solutions to other analytical 
possibilities. 



Scale-invariant systems 

To illustrate this procedure we consider the emblematic 
equation used in many models of mathematical finance 
(e.g., the Black-Scholes model [22]) and cities' and firms' 
sizes (Gibrat's law [23]), and also many of the "social" 
examples listed above. We speak then of the so-called 
geometrical Brownian motion: 



x{t) — kx{t), 



(7) 



which is symmetric under scale transformations x' — cx 
with c an arbitrary constant. We are here defining a 
proportional growth or multiplicative process. A sys- 
tems following such dynamics can be described by our 
approach. We set u = log(a;/a;o), where xq is the min- 
imum allowed value for x (or a "reference" -one). The 
Jacobian of the transformation is du/dx — and the 
volume in u space is defined as O: [0 < u < um]- Here 
um = ^og(xM I Xq) corresponds to some maximum allow- 
able value (which can be infinity). When no constraints 
are included and an infinite volume f2 is considered, the 
constituents of system diffuse as random walkers in u. 
Accordingly, the density distribution (DD) pij{u)du is a 
gaussian distribution that becomes a log-normal one in 
X. This no-constraint case has been widely studied, and 
most city-population distributions follow it [23] . 

Let us discuss now just how adding constraints af- 
fect the equilibrium DD by considering the simple form 
fi{u) = M*. The case i — corresponds to normaliza- 
tion of the DD, while i — 1 refers to having (u) as a 
constraint. We tackle the variational solution subjected 
to these two constraints, setting /i = Ao and A = Ai. 
Accordingly, the equilibrium DD pu{u) extremizes the 
functional F = S — fj, — A(u), so that 



Pu{u)du = e 



du. 



(8) 
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The values for /z and A are obtained from the conservation 



rules 1 - . 
yield {u) — 
of X, becomes 



: 1/A if UM 



CO. 



The DD, 



-M-Atiy which 
as a function 



Px{x)dx 



dx 



(9) 



px{x)dx 



(10) 



i.e., a power law whose exponent is characterized by A, 
as that presented in Eq. ([T]). Look first at the particular 
solution A = (no constraint on {u)). One is led to 
e'^ = um and (m) = um/'2.. In terms of x the DD is 

1 dx 

Um X 

As remarked above, such DD is related to Benford's Law. 
Using a thermodynamic analogy, the law describes the 
simple scenario of a non-interacting system confined to 
a finite volume of w-space, with a Gaussian distribution 
for ii, i.e., a scale- free ideal gas (SFIG)p^. Even if the 
normalization seems to diverge here for um — oo, it can 
be kept finite in going to the thermodynamic limit (see 
[IB]): if N is the total number of system's "elements", 
its density p{x) = Npx{x) is normalized in the limit 
{N, Um oo) if N/um Poj where po is a constant. Let 
us pass now to the particular solution fi = (relaxing the 
normalization constraint). If m^/ — >■ oo, we obtain from 
the conservation rules A = 1 and (u) = 1. Accordingly, 
the DD for the observable x is 



px{x)dx 



dx 

X'^ 



(11) 



i.e., Zipf's law (ZL). In a thermal context, the absence 
of normalization can be understood as an inability of the 
system to reach the thermodynamic limit. One may then 
speak of a Zipf regime [16]. The lack of normalization 
constraint is discussed also in Refs. [31 [14]. An expla- 
nation may be concocted: these elements might be dis- 
tributed on the surface of an appropriate volume [31 116j . 
Accordingly, ZL usually holds for the upper tail and not 
for the "bulk" of the associated distribution. There is 
a free interchange of elements between the two regions 
with no "energetic cost" since the "chemical potential" 
p is zero. Accordingly, the number of elements does 
not remain constant (photon statistics). This number- 
fluctuation has no effect in the bulk region where the 
thermodynamic limit is reached, but it totally determines 
how the density behaves on our putative "surface" . 



PRESENT RESULTS: NUMERICAL 
EXPERIMENTS AND EMPIRICAL 
OBSERVATIONS 

So as to confirm the preceding MaxEnt predictions 
we have performed numerical experiments with ran- 
dom walkers and compared them with empirical city- 
population data. We solve numerically the equation 
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FIG. 1: Top panel: Evolution of a random walkers' distri- 
bution without constraints from an initial delta-one (purple 
solid line), passing through intermediate stages (red dotted, 
green dashed, and solid blue lines). Inset: logarithmic size u 
vs. relative growth u (linear fit in black). Bottom panel: 
Florida's cities-population-distribution for 1990, 2000, and 
2010 (same color-code as TP) compared with a normal distri- 
bution (smooth solid blue line). Inset: same as in top panel's 
inset. 



x{t) = kx{t), discretizing the time in intervals of At. k 
is randomly generated at each iteration from a Gaussian 
distribution with zero mean and variance K. 



Free evolution 

We first study evolution without constraints (as a con- 
trol case), starting with N ~ 10000 walkers located at 
the same position x = 100 {u = 2 log 10), that evolve 
with K — 10 and a time interval At = 10~^ in x. We 
also include a drift, as in Ref. [23]. As expected, we ob- 
tain the diffusion process depicted in upper panel of Fig. 
1, which is a log- normal DD in x (and a Gaussian in u). 
We now compare such evolution with that of the cities- 
population of Florida State using Census Bureau's data 
for the years 1990, 2000, and 2010 [41], also finding 
a log-normal in x with growing width (1.64, 1.65 and 
1.72, respectively). The proportional growth condition is 
checked by computing the correlation between the log- 
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arithm of the population of each place u with its rel- 
ative change u, using two-points formulas for the time 
derivative. For u = k both observables are independent 
for scale-free systems. The correlation value is 0.027, 
small enough to confirm that geometrical motion is tak- 
ing place (see inset of Fig. 1, bottom panel). Even if the 
populations of Florida-cities do not seem to match the 
non-interacting assumption of our simulation, we expect 
a short-range nature for the correlation between the pop- 
ulation of each pair of cities |32], relative to the State's 
size. According to our thermodynamic interpretation, we 
can think of a dilute scale-free gas at zero pressure and 
expanding freely. 



Evolution constrained by normalization and finite 
volume 



For an example with normalization constraint and fi- 
nite volume (A = 0), we arbitrarily define a;o = 1, 
xm — iC, {um — 4 log 10) and use the same initial 
conditions as in the precedent case. Now, a "move" is 
not accepted if the new position falls outside the appro- 
priate region. After some iterations the system reaches 
equilibrium, as shown in Fig. 2, top panel. We find a 
nice fitting of this equilibrium distribution to that pre- 



dicted by MaxEnt [Eq. (10)], confirming the validity of 
our approach. 

A system in this thermodynamical condition 

1. obeys the proportional growth dynamics, 

2. exhibits low correlation between its elements. 




Rank 



FIG. 2: Top panel: walkers' distribution for the first text- 
example A = (origin of Benford Law). The blue solid line 
indicates convergence towards the MaxEnt-prediction (black 
solid line). Inset: equilibrium rank-distribution. Bottom 
Panel: rank-distribution of Marshall Islands' city-population 
versus i) MaxEnt-prediction for A = (black line) and ii) a 
log- normal (red line). Inset: logarithmic size u vs. relative 
growth ii (linear fit in black). 



3. conserves the particle-number, and 

4. is characterized by an objective, measurable size- 
constraint. 

This last condition, for city-populations, has the form 
of a geographical constraint. Such is the case of, e.g., 
the Marshall Islands: this particular geographical area 
covers 181.3 km^ distributed into 29 atolls and 5 islands. 
Traversing the sea may reduce correlations between cities 
as compared to terra firma. The migration pattern con- 
centrates in the two main population-centers, Rita and 
Ebeye, so that only indirect correlations are expected 
between the rest of the cities. We verified this issue by 
recourse to data from 1980, 1988, and 1999 [43]. The 
islands' logarithmic population u, exhibits a correlation 
coefficient of 9 x 10^'^ with a relative increment ii, con- 
firming the dynamics' nature (inset of Fig. lb, bottom 
panel) . We fitted the raw data for all low-correlated cen- 
ters (154 populations) to the MaxEnt prediction and also 
to a log-normal with the same log-mean and log-variance 
that characterize the data (Fig. lb, bottom panel). We 
found a correlation of 0.991 in the former case and of 



0.979 in the latter one. One thus may visualize the Mar- 
shall Islands as a closed-volume scale- free ideal ga (SFIG) 
in thermal equilibrium, again empirically confirming the 
validity of our approach. 



Evolution constrained by a mean value-condition 

The second example presented here (/i = 0) uses a simple 
algorithm that reproduces Zipf's Law. N walkers "un- 
fold" while guaranteeing the conservation laws' proper 
working: (log(a;/a;o)) = (m) = 1 (or Yld=i^og{x,/xo) = 

T^^=i '^i — -^)- We start all the walkers at a; = e x a;o 
(any distribution with {u) = 1 is adequate) and proceed 
iterating as follows: 



1. select an arbitrary walker and change its position 
in proportional fashion according to x{t + At) = 
x{t) X (1 + kAt), with a random value k; 

2. assume that we have a bulk reservoir so that the 
exchange of elements has no 'energetic cost'. We 
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randomly select a second walker and remove him 
from his position x'; 

3. add a new walker at a position that preserves the 
operating conservation rule, that is x'{t + At) — 
x'{t) X (1 - fcAt). 

We iterate these steps till convergence is achieved. After 
some iterations, we do converge to the predicted Max- 
Ent distribution Eq. (11) (Fig. 3, top panel, with the 
same values for xq, iV, K, and Ai used in the first exam- 
ple), thusly Once more, we reconfirm that the approach 
presented here works in reasonable fashion. Remarkably 
enough, a similar algorithm is also able to reproduce any 
arbitrary power law with exponent A + 1 by changing the 
value of the conservation rule in the fashion (u) = 1/A. 
For city-populations, elements-exchange occurs in a con- 
tinuous fashion. However the data are recorded at in- 
tervals of years, so that the ensuing exchange-effects are 
similar to those of our simulation. Areas belonging to the 
most-populated sites eventually change with time. Thus, 
we deal with an open system with no fixed number of 
elements, like our photon-gas above. As a well known 
example consider the most populated metropolitan USA- 
areas |15j . We have confirmed the proportional growth 
hypothesis using data from years 1990, 2000, and 2010 
[3T] to find a small correlation of 0.016 between u and 
ii (inset of Fig. Ic, bottom panel). Our first N = 150 
areas are in Zipf's regime (Fig. 3, bottom panel) and 
constitute the surface of the statistical system at hand. 
We have verified the prevailing conservation rule, find- 
ing X^ili log(a:i/a;i5o) = 145.7, 150.8, and 154.9 for each 
year, respectively, close to the MaxEnt prediction of 150. 
We have also verified that 10 of these areas (a 6.7%) that 
in 1990 pertained to the Zipf regime are not characterized 
by it in 2010. 
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FIG. 3: Top panel: walkers distribution for the second text- 
example /i = (Zipf's Law). Inset: equilibrium rank- 
distribution (sIope= 1 in black). Bottom panel: rank- 
distribution of USA's most-populated metropolitan areas for 
1990, 2000, and 2010; Inset: logarithmic size u vs. relative 
growth ii (linear fit in black). 



CONCLUSIONS 

Different phenomena involving scale-invariance can 
be unified via Jaynes' MaxEnt principle, provided 
that adequate dynamical information is suitably 
incorporated into the variational process, in the man- 
ner here prescribed. This allows one to perform a 
thermodynaniic-like description of social systems, re- 
lated to that of an ideal gas. 



ACKNOWLEDGMENT: This work was partially 
supported by ANR DYNHELIUM (ANR-08-BLAN- 
0146-01) Toulouse, project PIP1177 of CONICET, (Ar- 
gentina), project FIS2008-00781/FIS (MICINN), and 
project FEDER (EU) (Spain, EU). 



[1] M. Schroeder, Fractals, chaos and power laws (Freeman, 
NY, 1990). 

[2] M. E. J. Newman, Phys. Rev. _B 64, (2001) 016131. 
[3] A. Hernando et al., A. Plastino, Phys. Lett. A 374, 18 
(2009). 

[4] A.-L. Barabasi, R. Albert, Rev. Mod. Phys. 74, 47 (2002). 
[5] T. Maillart, et al., Phys. Rev. Lett. 101, 218701 (2008). 
[6] B. Blasius, R. Tonjes, Phys. Rev. Lett. 103, 218701 

(2009) . 

[7] R. N. Costa Filho, et al., Phys. Rev. £60, 1067 (1999). 
[8] A. A. Moreira, et al., Phys. Rev. E, 73, 065101(R) (2006). 
[9] L. C. Malacarne, R. S. Mendes, and E. K. Lenzi, Phys. 
Rev. E 65, 017106 (2001). 
[10] M. Marsili, Y. C. Zhang, Phys. Rev. Lett. 80, 2741 
(1998). 

[11] R. L. Axtell, Science 293, 1818 (2001). 

[12] G. K. Zipf, Human Behavior and the Principle of Least 

Effort (Addison- Wesley, Cambridge, Mass., 1949). 
[13] Y. Chen, Chaos SoUtons Fract. 45, 63 (2012). 
[14] B. C. Murtra, R.V. Sole, Phys. Rev. E 82, 011102 

(2010) . 

[15] X. Gabaix, Quart. J. Econom. 114, 739 (1999). 



6 



[16] A. Hernando, C. Vesperinas, A. Plastino, Physica A 389, 
490 (2010). 

[17] A. Hernando et al., Eur. Phys. J. B 76, 87 (2010). 

[18] Weisstein, Eric W., Benford's Law from MathWorld. 

[19] F. Benford, Proceed. Am. Phil. Soc. 78, 551572 (1938). 

[20] Wikipedia http: / / en.wikipedia.org/wiki /Log-normal 

[21] J. S. Huxley, (1932) Problems of relative growth (Methuen 
& Co. Lmtd., London, 1932). 

[22] S. M. Ross, Introduction to Probability Models, 9th edi- 
tion (Academis Press, NY, 2007). 

[23] J. Eeckhout, Ame. Eco. Rev. 94, 1429 (2004). 

[24] H. Rozenfeld, et al., Proc. Nat. Acad. Sci. 105, 18702 
(2008). 

[25] S. Ree, Phys. Rev. E 73 (2006) 026115. 
[26] W. J. Reed, B. D. Hughes, Phys. Rev. E 66 (2002) 
067103. 

[27] S. Galam, J. Stat. Phys. 61, 943 (1990). 
[28] UrbanSim: http://www.urbansim.org. 
[29] C. Castellano, S. Fortunato, V. Loreto, Rev. Mod. Phys., 
81, 591 (2009). 

[30] S. Fortunato, C. Castellano, Phys. Rev. Lett. 99, 138701 
(2007). 



[31] M. Batty, Cities and Complexity (MIT Press, Cambridge, 
MA, 2005). 

[32] S. Galam, Physica A 285, 66 (2000). 

[33] A. Katz, Principles of stat. mechanics (Freeman, San 
Francisco, 1967). 

[34] A. Katz, Principles of statistical mechanics (Freeman, 
San Francisco, 1967). 

[35] A. Plastino, Physica A 344, 608 (2004). 

[36] C. Tsallis, Introduction to Nonextensive Statistical Me- 
chanics (Springer, NY, 2009). 

[37] A. R. Plastino, A. Plastino, Phys. Lett. A 177, 177 
(1993). 

[38] A. Hernando, A. Plastino, A.R. Plastino, 
arXiv:1201.0889 and Eur. Phys. J. B (2012) in Press. 

[39] H. Simon, Biometrika 42, 425 (1955). 

[40] B.R. Frieden, B.H. SoflFer, Phys. Rev. E 52, 2274 (1995). 

[41] Census bureau. Government of USA (web). 

[42] G. Krings et al., J. Stat. Mech, L07003 (2009). 

[43] Econom. Pol. Plann. & Stat. Office, Rep. of the Marshall 
Islands (web). 



